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(57) Information stored at a resource remotely 
located across a network is automatically imported 
according to pre-defined selection criteria, formatted in 
a form usable by a local processing resource, stored in 
the formatted form, and provided to the local processing 
resource for use by it. In one embodiment, a mafling fist 
manager has an import module coupled to a browser. 
The import module receives a set of user-defined infor- 
mation selection criteria. The import module commands 
the browser to estabfish a network connection to a 
remote network resource containing desired informa- 
tion. When the connection is established, the browser 
requests information from the remote network resource 
according to the selection criteria. Information received 
from the network resource is stripped of unnecessary 
information, formatted for use by the mailing fist man- 
ager and stored in a database maintained by the mailing 
list manager. In a preferred ernbocfiment the network 
resource is a Web server with a search page that is cou- 
pled to a searchable database of addresses, and the 
addresses are usable in a mailing 1st 
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Description 

The present invention relates generally to computer systems, and more particularly relates to automatically select- 
ing and importing information from a network resource, and formatting the information for use by a local processing 
resource. 

The globally accessible computer network known as the Internet is a vast resource of information, much of which 
is avaflaWe at no direct cost. A local computer can connect to a cistant server, request a file or an image from the server, 
and receive the requested information immediately without a direct charge for the value of the information. 

One popular technology enjoying wide use with the Internet is known as the World Wide Web. The World Wide Web 
enables a computer to locate a remote server using a server name in an agreed-upon format that is indexed at a central 
Domain Name Server (DNS); the computer can then establish a connection to the server ad retrieve information using 
a communication protocol called the Hypertext Transfer Protocol (HTTP). A Uniform Resource Locator (URL) uniquely 
identfies each page of information stored on the remote server. A URL is a form of network address that identifies the 
location of information stored in a network. The focal computer requests information by providing a request containing 
a URL of the desired information to the remote server. The pages of information are files prepared in the Hypertext 
Markup Language (HTML). 

As use of the World Wide Web has soared, many different sources of free information have become available. The 
information is free in the sense that the server that provides the information makes no cSrect charge to an end user who 
requests or receives the information. 

One type of free information service isadu-ectcfyservk^ AcOrectory service is a remote server coupled to a data- 
base containing names, addresses, telephone numbers and other information about businesses and individuals 
derived from public telephone directories and other sources. Exemplary cfrectory servers include the "fourIP telephone 
directory server identified by the URL http://www.foum .com/, rtttp7/www.Bigbook.cx)rn/. http://yp.gte.net/, 
rtttpyAvww.Bigyellcw.com/. Some systems offer free access to database of incfividual address information, effectively 
operating ss free ''white pages" services. Other systems provide free access to databases of business address infor- 
mation, effectively operating as globally accessible, free "Yellow Pages" directories. 

Using any Web browser program, a user can connect to these systems, provide a query, and receive information, 
such as a set of names, addresses, and telephone numbers. The user's local computer can make an HTTP connection 
using a browser to the remote server of a directory service. The user commands the browser to retrieve a home page 
of the remote server. The user enters a query in the home page to request a set of information from the database and 
returns the query with the home page to the server. The server parses the query, interrogates the database accorcfing 
to the qu ery, receives a set of records from the database, formats th e records by placing them in a template document 
prepared in the HTML language, and delivers the completed HTML document containing the records to the local com- 
puter. 

After the information is received in an HTML dc^merrt displayed by the browser, the set of 
one HTML page at a time using the browser, cut from each HTML document pasted into a ffle prepared with a text edi- 
tor or word processor program, and manually edited into a mafling list 

Alternatively, the set of names can be cut from the HTML document stored in a text file using a word processor, 
edited to add field delimiters between values of each record, and provided as input to a maffing list management pro- 
gram. Each of these steps is manually performed, separately. The mailing fist management program reads the text file, 
separates records and values by tier^ list data- 

base. The mailing list management program can then generate mailing labels, form letters, reports, and other docu- 
ments using the maifing list database 

While directory services provide information that can be used to manually build a mailing list from free information 
available over a network, the approaches described above have several significant disadvantages. For example, man- 
ually building a mailing list using cut-and-paste ecfting operations with a word processor is tedious, time-consuming, 
and error-prone. It is desirable to provide an automatic way for a user to define desired free information, retrieve the free 
information from a remote server over a network, and format the information into a form usable by a focal processing 
resource. 

Also, directory services have the tisadvantage that considerable time is required to format information received 
from the directory service Further, additional time is needed to instruct a local processing resource how to receive the 
formatted information. Therefore, it is also desirable to provide an autwraticw^ to carry out these steps a^ 
resulting list to a local processing resource for subsequent use 

Further, as the number of Web servers hasinaeased.it has beccm mere c^k^ for a r^ersc^exrx>sed to severe 
different Web servers or URLs to keep track of the names of each Web server and its URL for later use Many URLs 
are quite long and need to be spelled with 100% accuracy to be useful. They include unusual typed characters such as 
colons, backslashes and "@" characters. These characters and other parts of the URL must be remembered and repro- 
duced correctly, or the URL cannot be used successfully to retrieve a Web page. 
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For all these reasons, there is a need for method, apparatus and products that can request and retrieve information 
from a remote source; format the information for local use; and pass the formatted information for local use. There is 
also a need for such methods, apparatus and products that can retrieve such infara in an automatic way, and auto- 
matically format the information for use by a local processing resource. 

5 The invention provides a method, apparatus, and corrputer program product for importing information from a net- 
work resource. In one embodiment a method according to the invention comprises the steps of storing selection criteria 
in a memory coupled to a local processing resource; requesting the information from the remote resource according to 
the selection criteria; receiving the information from the remote resource over the network; and providing the informa- 
tion to the local processing resource in a format usable by the local processing resource. One feature of this embodi- 

10 ment is connecting to the remote resource using an application program driven by the local processing resource. In one 
aspect, the application program is a browser, and the browser is driven using a bridge. In another aspect, the browser 
is driven by the local processing resource using OLE Automation. 

Another feature of the invention is storing, in the memory a list of remote information sources located on the net- 
work, and a prof ie describing a format of information at one of the remote information sources. In one aspect the infor- 

15 mation at the remote information source is converted from one format to a second format usable by the local processing 
resource according to the profile. In another aspect an import function of the local processing resource is invoked, and 
using the irrport function, the information is stored in the second format in a database coupled to the local processing 
resource. In yet another aspect, the profile is retrieved from a central data storage location on the network. 

Another feature of the invention is providing the information by executing a browser, and commanding the browser 

20 using the local processing resource to connect to the remote information source over the network. 

StiD another feature of the invention is storing a query in a Web page in a storage device coupled to a browser. In 
one aspect requesting the informatkxi involves srt In another 

aspect, receiving the information includes receiving an output Web page into the Web browser over the network. In yet 
another aspect providing the information involves writing a raw mailing list derived from the output Web page on the 

25 storage device, formatting the raw mafling list into a formatted list based upon a format of a database coupled to a mail- 
ing list manager, and irrporting the formatted Bst into the database of the mafling Ost manager using an irrport function 
of the maiing list manager. 

The invention also encompasses a computer apparatus and a computer program product having the foregoing fea- 
tures and aspects. 

30 The foregoing and other features, aspects and advantages of the present invention will become more apparent 
from the following detailed description of die present invention when taken in conjunction with the accompanying draw- 
ings. 

The objects, features and advantages of the system of the present invention will be apparent from the following 
description in which: 

35 

FIG. 1 A illustrates a computer of a type suitable for carrying out the invention. 
FK3.1 Bis a block diagram of the computer of FKx 1A. 

FIG. 2A is a block diagram of a software architecture of one embodiment of the invention. 
FIG. 2B is an exemplary screen display produced by one embodiment of the invention when cfisplaying a main win- 
40 dow. 

FIG. 2C is an exemplary Import Rom Internet dialog box displayed by one embodirnerTt of the invention 
FIG. 2D is an exemplary Selection Criteria dialog box displayed by one embodiment of the invention. 
FIG. 3 is a block diagram of a software architecture of an alternate embodiment of the invention. 
FIG. 4A is a flowchart of a method according to one erribodknent of the invention. 
45 FIG. 4B is a flowchart of a first portion of a method according to an alternate embodiment of the invention. 
FIG. 4C is a flowchart of a second portion of the method of FIG. 4B. 

A method, apparatus and corrputer program product for automatically importing information from a network is 
described In the following description, for the purposes of explanation, numerous specif ic details are set forth in order 
so to provide a thorough understanding of the present invention, ft will be apparent however, to one skilled in the art that 
the present invention may be practiced without these specific details. In other instances, well-known structures and 
devices are shown in block diagram form in order to avoid unnecessarily obscuring the present invention. 

The detailed descriptions that follow may be presented in terms of program procedures, routines or processes exe- 
cuted on a computer or network of computers. These procedural descriptions and representations are the means used 
55 by those skilled in the art to most effectively convey the substance of their work to others skilled in the art 

A procedure, routine, or process is generally conceived to be a serf-consistent sequence of steps leading to a 
desired result These steps require physical manipulations of physical quantities. Usually, though not necessarily, these 
quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and 
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otherwise manipulated. Principally for reasons of common usage, at times it is convenient to refer to these signals as 
bits, values, elements, symbols, characters, terms, numbers, or the like. It should be noted, however, that all of these 
and similar terms are associated with physical quantities and are merely convenient labels applied to such quantities. 
Further, the manipulations performed are often referred to in terms, such as adding or comparing, which are com- 

5 monly associated with mental operations performed by a human operator. No such capability of a human operator is 
necessary, or desirable in most cases, in any of the operations described herein that form part of the present invention; 
the operations are machine operations. Useful machines for performing the operation of the present invention include 
general-purpose digital computers or similar devices. 

The present invention also relates to apparatus for performing these operations. This apparatus may be specially 

10 constructed for the required purpose or it may comprise a general purpose computer as selectively activated or recon- 
figured by a computer program stored in the computer. The procedures presented herein are not inherently related to 
a particular computer or other apparatus, various general-purpose machines may be used with programs written in 
accordance with the teachings described in this document, or it may prove more convenient to construct more special- 
ized apparatus to perform the descrbed method steps. The required structure for a variety of these machines will 

is appear from the description given. 

FIG. 1A illustrates a computer of a type suitable for carry out the invention, viewed externally in FIG. 1 A, a com- 
puter 2 has a central processing unit 100 having tisk drives 110A and 1 10B. Disk drive indications 1 10A and 1 10B are 
merely symbolic of a number of disk drives that might be accommodated by the computer ^ Typically, these would 
include a floppy disk drive such as 11 OA, a hard disk drive (not shown externally) and a CD ROM drive indicated by slot 

20 110B. The rtumber and type of drives van 

play 120 upon which information is displayed. A keyboard 130 and a mouse 140 are typically also available as input 
devices. Preferably, the computer 2 illustrated in FIG. 1A is a personal computer. 

FK3. 1 B illustrates a block diagram of the internal hardware of the computer 2 of FK3L 1 A. A bus 150 serves as the 
main information highway interconnecting the other components of the computer. CPU 155 is the central processing 

25 unit of the system, performing calculations and logpc operations required to execute programs. Read only memory (1 60) 
and random access memory (1 65) constitute the main memory of the computer. Disk controller 1 70 interfaces one or 
more disk drives to the system bus 150. These cfsk drives may be floppy disk drives, such as 173, internal or external 
hard drives, such as 172, or CD ROM or Digital Video Disk (DVD) drives such as 171 . A display interface 125 interfaces 
a display 120 and permits information from the bus to be viewed on the display. A printer interface 152 interfaces the 

30 bus 150 to a printer 154 and permits information from the bus to be printed on the printer. 

The modem 190 is connected to a telephone network 191 in the usual manner. Communications with a network 
195 or with other external devices can occur over communications port 185. 

In one embodiment the data network 195 is coupled to a globally accessWe computer network such as the Inter- 
net 50. Alternatively, the network 1 95 is a local area network. For example, using a modem bank 190 and telecommu- 

35 nications software, the computer can call a ISP and estabfish a dial-up connection to the Internet 50. A remote Web 
server 60 is connected to the network. The remote server 60 contains source documents, such as pages formatted in 
the HTML language, or websites referenced in a source document In conjunction with a mailing fist manager program, 
a computer 2 in this arrangement can be used tor generating and managing maiOng fists and related documents. For 
example, the computer 2 may run the Pitney Bowes SrnarfJUta/J^ program. Using the SmartAfe/ter program or another 

40 mailing list manager, a user defines a database of addresses to be stored on the hard disk 172. The user enters data 
representing addresses into the database. The user also prepares one or more form documents or templates, such as 
brochures or form letters, using word processing functions of the mafling fist manager. The user queries the database 
using selection criteria that specify a range of actresses within the database, 

iriar field or market Using predefined instructions, the user can instruct the mailing list manager to create a copy of a 
45 form document for each of the selected addressees arid to insert the addressee s ato 

user can then instruct the mailing list manager to print each customized document and an envelope addressed to the 
addressee, on the printer 154. The form documents and selection criteria are stored on the hard disk 172 for subse- 
quent use. The mailing list manager also can print mailing labels, envelopes and the like according to various selection 
criteria. 

so The present invention is related to the use of computer 2 to import information from a network Accorcfing to one 
embodiment importing information from a network is performed by computer 2 in response to CPU 155 executing 
sequences of instructions contained in memory 1 60. 1 65. Such instructions may be read into the memory from another 
computer-readable medium, such as data storage devices 1 71 , 1 72, 1 73. Alter natively, the instructions are downloaded 
from the Web server 60 through the Internet 50 and the modem 1 90 or the communications port 1 85. The downloaded 

55 instructions are stored in one of the data storage devices 171, 172, 1 73 and are men toaded into the memory 160. 165. 
Execution of the sequences of instructions contained in the memory causes CP U 155 to perform the process steps that 
win be described hereafter. In alternative embodiments, hard-wired circuitry may be used in place of or in combination 
with software instructions to implement the present invention Thus, the present invention is not limited to any specific 
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combination of hardware circuitry and software. 

A URL is a network address or a reference to a remote World Wide Web server 60 or website. URLs generally are 
formatted according to the following template: 

(protocol id) JI {(server)) (second level domain) (top level domain) {(directory)}) {(file)} 
5 The (protocol id) identifies the transmission protocol to be used. For example, in the case of the Web it is (http).and 
in the case of an anonymous file transfer protocol transaction it is (fftp) The (server) element is an optional server 
name such as (www.). The (server) element may also identify a cfirectory on a storage device of the Web server 60 
that contains HTML documents. The (second level domain) element is the name of the server domain as found in the 
DNS table, such as (pitneybowes). The (top level domain) element identifies the type of the second level domain, and 
io must be an item selected from a finite set of globally recognized top level domains, such as "com." "org." "mfl," "edu," 
"gov," and others. The (directory) element is an optional name of a directory within the (server), such as 
( DocumentRoot/) . The (file) element is an optional name of a file, document or image to be retrieved, such as 
( lndex.html>. Thus, a URL serves as one type of a network address to tocate a documem anywhere in a network. 
FIG. 2A is a block diagram of a software architecture with which an embodiment of the present invention can be 
is implemented. 

An operating system (OS) 200 is installed in the central processing unit 100 of computer 2, such as on the hard 
drive 172. In conventional manner, the OS 200 is loaded by the CPU 1 55 into the main memory when the computer is, 
powered up. The OS 200 controls and supervises the operation of one or more local processing resources. In one 
embodiment, the local processing resources are application software programs, and the OS 200 facOrtates interaction 
20 of the application programs with functions of the OS 200 and of the hardware elements of the computer 2. 

In the embodiment of FIG. 2A, the application programs are a communications program 202, a Wbrfcl Wide Web 
browser 204, and a mailing list manager 206. Each of the application programs runs on the system, lor example, by 
residing in the main memory (160, 1 65) or in the data storage devices 171, 172, 173, for execution by the CPU 155. 
The cxxnrnunications program 202 enables the OS 200 and other application programs to connect to the data net- 
25 work 195 or the telephone network 191 . For example, under control of the OS 200 or an application program, the com- 
munications program 202 commands the modem 190 to take a telephone line connected to the modem 190 off-hook 
wait tor dial tone, and place a telephone call to an ISP. The communications program 202 establishes a connection with 
the ISP. When a application program writes data to the communications port 185 or to the modem 190, the communi- 
cations program 202 converts the data to a form usable by the modem or the port Exemplary commercially available 
30 communications programs are ProCOMM, and the TCIP cfialer programs generally. When Microsoft Windows95 is used 
as the OS 200, the communications pro-am 202 is integrated with the OS 200. 

The browser 204 reads a document prepared in HTML that is stored on the hard disk 172 or the Web server 60, 
interprets the HTML tags contained in the document, and displays graphics, text and other elements encoded in the 
document on the display 1 20. 

35 The mafling fist nranager 206 is an applicatwn program that aeates and list database 260. and 

prepares a maiGng Gst 262 based upon the mailing fist database 260. The mailing list manager 206 reads and writes 
the mailing fist database 260, and writes the maifing fist 262, through a generic file input/output system integrated into 
the OS 200. The maiGng fist manager 206 can add. delete, or mocfify records in the mafling fist database 260. query the 
mailing list database 260 and select records from it based upon a user-provided query in a structured query language. 

40 and prepare reports or mailing lists. An exemplary commercially available computer program that can be used as the 
mailing list manager 206 is the Pitney Bowes SmartMailer product 

The browser 204 may obtain a document from the Web server 60 by establishing an HTTP connection through the 
comrrtmications program 202 to the Web server 60. The Web server 60 runs a Web server progra 
protocol to respond to the browser 204. The Web server 60 has a storage device 66 that stores documents such as 

45 HTML format documents or Web pages. To obtain a document from the Web server 60. the browser 204 issues a 
request in the HTTP protocol, such as an HTTP GET command, with the name of the desired document as an argument 
to the command. In response, the Web server 60 searches the storage device 66 for the requested document If it is 
found, the Web server toads the document and delivers it over the HTTP oormectkxi to the browser 204. The HTTP cw 
necbon is then Disconnected. 

so The Web server 60 is coupled through a firewall 62 to a database 64. The database 64 contains data or HTML doc- 
uments for deOvery to browsers that contact the Web server 60. The firewall 62 prevents urauthorized requests for 
umerrts from reaching and possixy corrupting the database 64. 

Bridge software 210 runs in connection with the OS 200 and acts as a link between the application programs. The 
bridge software 210 enables one application pro-am to communicate with another application program through an 

55 agreed-upon, shared inter-appfication communication scheme In one embodiment the bridge software 210 provides a 
distrfouted object communication facility. An appBcation program can write data to an object, and invoke a transport rou- 
tine to cause the bridge software 210 to transport the object to another application program. The receiving application 
program extracts a message and data from the object and acts upon them. 



5 



EP 0 890 913 A1 

In a preferred emtxxf ment, the OS 200 is the Microsoft Windows95 operating system, and the bridge software 21 0 
is the Object Linking and Embedding (OLE) Automation facility integrated into the Microsoft Windows95 operating sys- 
tem. Using OLE Automation, one application program written for use with OLE Automation can control or drive the oper- 
ation of another application program that is also written for use with OLE Automation. Using OLE Automation, a first 

5 application program rs called a server component and is controlled by a second application program called an automa- 
tion client or automation controller. An automation controller is driven by a scripting language, or forms a part of an 
application program. Automation is carried out using a dynamic object method invocation facility of OLE known as c§s- 
patchaUe interfaces. Using dynamic dispatching under OLE, an automation cfient can invoke a method or manipulate 
a property of a server component by a late birvjng mechanism. At run time, the automation cfient obtains a dispatch 

w identifier from a type library associated with the server component The dispatch identifier is passed to an "invoke" 
method of OLE Automation that resolves which method of the server component to call at run time. The type Ibrary is 
created at run time by an object definition language (ODL) fie that describes interfaces of the server component The 
ODL fie is prepared by the programmer who writes the server component. 

OLE and OLE Automation are described in detail in R. Orfali et a!., The Essential Distributed Objects Survival 

is Guide; (New \brkr John Wiley & Sons. Inc., 1996). at pp. 283-296 and pp. 425-532. 

Alternatively, the bridge software 210 can be a Object Request Broker (ORB) that complies with the Common 
Object Request Broker Architecture (CORBA), or the bridge software 21 0 can be the OpenDoc system as defined by 
Apple Computer, Inc. 

In one embodiment of the invention, the mailing fist manager 206 has an Internet import module (IIM) 208, as 

20 shown in FIG. 2 A. In a preferred embodiment the IIM 208 forms an integral part of the mailing list manager 206. As 
shewn in FIG. 2B, when the mafling list manager 206 with the IIM 208 is in operation on computer 2, the IIM 208 gen- 
erates a screen display comprising a main window 270 on the display 1 20. The main window 270 has a command pane 
271 that displays commands labeled Re 272, Edit 274, View 276, Tools 278, and Help 280. The operating system 200 
displays a cursor 282 ever the main window 270; a user controls the cursor 282 by moving the mouse 1 40. When the 

25 user positions the cursor 282 over the Fde command 272 and presses a button the IIM 208 displays 

a Re pun-down menu 273. The Re pufl-down menu 273 has functions including a Open function 272a, a Close func- 
tion 272b a Save function 272c, and an Import From Internet action 272d. The Open function 272a, Close function 
272fa, and Save function 272c are examples of functions that can be provided by the IIM 208; however, they are not 
retired and other functions may be provided. 

30 When the user positions the cursor 282 over the Import From Internet function 272d and presses a button on the 
mouse 140, as shown in FIG. 2C, the IIM 208 displays an Import From Internet dialog box 282. The dialog box 282 has 
a network resource fist 284 that lists one or more network resources 284a, 284b. For example, a first network resource 
284a is named Fourtl and has a network address 286a of Vww.four1l.com." Thus, in this embodiment the network 
addresses are URLs. The cfialog box 282 has a scrofl bar 287; by positioning the cursor 282 on the scrofi bar 287, a 

as user can cause the network resource Bst 284 to cfisplay more network resources. 

The Import From Internet dialog box 282 has a Select function button 288. a Profie function button 290, and a Can- 
cel function button 292. Each of the function buttons 288. 290, 292 is associated with program code in the IIM 208 that 
carries out processing steps in response to activation of the associated function button. 

When the user positions the cursor on the Cancel function button 292 and clicks the mouse 1 40. the I IM 208 closes 

40 the dialog box 282, closes the pull-down menu shown in FIG. 2B, and discontinues further processing. 

To cause the IIM to connect to a network resource and automatically import information from it, the user positions 
the cursor on the Select function button 292 and clicks the mouse 140. In response, the IIM 208 closes the dialog box 
282, and reads a Site Proffle fie 220 associated wrth me selected ^ The Site Profaefie 220tsatable 

that has a row for each network resource, and is indexed by an integer value that uniquely identifies the network 

45 resource. Each record of the Site Profile file 220 has information that is used by the IIM 208 to connect to a network 
resource, import information from the network resource, and format the imported information for use by the mailing list 
manager 206. 

The Site Profile file 220 is coupled to a Site Index file 230. The Site index f He 230 is an index to the Site Profile file 
220. Each record in the Site Index fie 230 is a pair of values. One element in the pair is a unique index number that 

so uniquely identifies each record in the Site Prof ie file 220. The other element in the pair is a pointer to the location in the 
Site Profle f fle 220 of the record identified by the index number. In this manner, the Site Profile file 220 can be a file of 
sequential records that is maintained in a non-ordered manner, and the index file can be a random-access fOe that is 
sorted as necessary, and enables rapid access to records in the Site Profie file 220. In an alternate embodiment the 
Site Profie ffle 220 is self-indexing. 

55 In the preferred embocfiment each record in the Site Prof fle file 220 stores information in the fields shown in Table 1 . 
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TABLE 1 





SITE PROFILE RLE FORMAT 


5 


FIELD 


CONTENTS 




Site Identifier 


Integer index value 




Name 


Common name of network resource name 


10 


Home Address 


Network address or URL of the home page of the network resource 




Search Addrl 


URL of the first page in the network resource that permits a user to enter a search query 




Search Addr2 


URL of a second Dace in the network resource that oermits a user to enter a search nuerv fit 
any) 


15 


Searchable Fields 


Pointer to a list of fields in the page at Search Addrl that can be searched upon 




Booleans 


Whether boolean search queries are permitted 




Returned Fields 


Pointer to a fist of fields that are returned in response to a search query 


20 


Record Template 


Pointer to a template describing the data format of the fields that are returned in respcytse to a 
search query 




HTML Template 


Pointer to an HTML template describing the format of an HTML page that is returned by the 
neiworK resource in response to a searcn query 


25 


MaxRecords 


Integer value of the maximum number of records returned by the network resource in response 
to a search query 




Cookie Rag 


Boolean flag indicating whether the network resource needs to set cookies to respond to a 
search query 


30 


Timeout Value 


Integer value representing a number of seconds that the 1 IM 208 should wart to connect to the 
network resource; failure of the network resource to resporxi in that time is defined as a tvrieout 




Retries 


Integer value representing the maximum rtuntoer of times the IIM 20£ should attempt connec- 
tion to the network resource before flpving up 



35 Also in the preferred embodiment the Site Profle fie 220 and its associated Site Index f 0e 230 are periodically 
updated from a central data storage location. The central data storage site is a Web server known a Site Profile Update 
Server connected to the Internet 50. The Site Profile Update Server stores a current copy of the Site Profile file 220 and 
the Site Index fie 230, containing records for each known network resource. The IIM 208 has a File Update module 
associated with and aecessed by an Update Prof Des command in the File pulkfown menu 273. When the user selects 

40 the Update Profiles command, the File Update module of the IIM 208 commands the modem 190 or the communica- 
tions port 185 to contact the Site Proffle Update Server and establish an HTTP session with it Preferably, the session is 
conducted in a secure manner to prevent unauthorized access to the Site Prof ae Update Server. For example, the 
HTTP session is conducted using Netscape Navigator as the browser 204, and the Site Profie Update Server cornrrai- 
nicates with the browser 204 using the Secure Sockets Layer (SSL) protocol. In a secure way. the browser 204 sends 

45 a password to the Site Profile Update Server. In response, the Site Profile Update Server transfers a copy of the then- 
current version of the Site Profile file 220 and the Site Index file 230. As the current version of each file is received, the 
File Update module of IIM 208 instructs the OS 200 to write the current version over the copy of the file stored on the 
storage devices 171, 172, 173. In this way, current versions of site profiles are downloaded to replace existing copies 
of the proffles. 

so Afier rearing the Site Profiefile 220 associated with the selected network resource, the IIM 208 requests the user 
to enter search criteria identifying the information to be imported from the selected network resource. The type of 
search criteria entered by the user is dependent upon information in the Site Profile file for the selected network 
resource, such as the fields that can be searched, or whether Boolean queries are allowed. 

In one embodiment, the IIM 208 displays a search criteria dialog box 300 having the format shown in FIG. 2a The 
55 search criteria dialog box 300 displays fields that can be searched in the selected network resource and ways to select 
values of the fields. By checking boxes and entering values in the search criteria dalog box 300, the user can build a 
query that will be applied to the network resource to create a set of information to be imported. 

For example, the search criteria dialog box 300 has a check box 302a associated with a NAME, FIRST field 302b. 
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The field 302b also is associated with an Equal check box 302c, a Less Than check box 302d, a Greater Than check 
box 302e, and a text field 302f. The user can instruct the KM 208 to import records having NAME. FIRST values equal 
to "MICHAEL" by clicking on the check boxes 302a and 302c and typing "MICHAEL" in the text field 302f. In the example 
of FIG. 2D, the user has cficked on the ZIP check box 304a and the check box 304d, and entered "95999" in the ZIP 
5 text field 302f. These entries instruct the IIM 208 to request the network resource to import records having ZIP values 
less than "95999." 

The search criteria dialog box 300 also has a MORE check box 306. When a user clicks on the MORE checkbox 
306, the IIM 208 will save the selection criteria that has been entered, and display a blank selection criteria dialog box 
300. Selection criteria entered in subsequent selection criteria oSalog boxes are appended to previously entered selec- 
ts tbn criteria In this way, a user can build a complex query. 

The number, name, layout and arrangement of the fields, check boxes, and text fields in the search criteria dialog 
box 300 are defined by the Site Proffle fie 220. 

In an alternative embodiment the IIM 208 displays a dialog box that enables the user to enter a query in the Struc- 
tured Query Language (SQL). This embodiment is used when the network resource is a web server 60 that has a data- 
is base 64 that can accept and respond to SQL queries. 

The Search Criteria cfialog box 300 has an Import function button 308, a Clear function button 31 0, and a Cancel 
function button 312. If a user clicks on the Clear function button 310. the IIM 208 dscards any selection criteria that 
have been emered in the (fialog bra If a user clicks on the Cancel func- 

tion button 31 2, the IIM 208 closes the dialog box 300 and discontinues further processing. 
20 When the user clicks on the Import button 308, the IIM reviews and tests the selection criteria that have been 
entered to determine whether they are valid and complete. If not the IIM displays an appropriate error message and 
prompts the user to enter correct data. 

When all the selection criteria are valid and complete, the IIM 208 connects to the network resource, submits the 
selection criteria, imports a set of information described by the selection criteria from the network resource, and stores 
2S the set of information in a format that is usable by a local processing resource such as the mailing ist manager 206. 

In the preferred embodiment the IIM 208 attempts to establish an HTTP connection, through the Internet 50. to a 
Web server 60 identified by the URL value contained in the Home Address field of the Site Profile file 220. Using OLE 
Automation, the IIM 208 in the rde of cont^ rde of server to issue the HTTP GET 

command to the Web server 60, with the selected URL as an argument If the Web server 60 fails to respond within the 
30 Timeout period, or if the browser 204 reports an error to the IIM 208. the IIM raises a error condition and processing is 
discontinued. 

If the HTTP GET command is successful, the home page or initial page of the website identified by the URL is 
retrieved to a cache memory of the computer 2 and tfsplayed on thecfisplay 120. The IIM 208 assembles the search 
criteria into a query having a format that is understood by the Web server 60, according to information in the Site Profile 

35 file 220. The IIM 208 then requests a search page from the Web server 60. The search page is an HTML document of 
the Web server 60 that is identified in the Site Prof le ffle 220 as the page at which a search query is entered. 

Search pages tor retrieving information from a Web server are known in general. When a user connects the search 
page of a Web server using a browser and without using the invention disclosed herein, the user may enter text or 
numeric values in search fields of the search page. The user then presses a button labeled "Submit", or the equivalent 

40 in the search page. This causes the browser to return a copy of the search page to the Web server including the text or 
numeric values entered by the user. The Web server reads the text or numeric values and retrieves or prepares another 
Web page according to the values that were entered In some systems, the Web server queries a database based upon 
the values that were entered, and prepares a Web page containing values retrieved from the database in response to 
the query. The Web page is then sent to the browser for cBsptey. 

45 According to the invention, when the search page is returned to the browser 204 by the Web server 60 of a direc- 
tory service, the IIM 208 makes a copy of the search page in main memory, and fiDs the copy of the search page with 
values of the search criteria. The IIM 208 then commands the browser 204 to send the flled-in copy of the search page 
back to the server 60. When the server 60 receives the filed-tn copy of the search page, the server 60 reads the values 
of the search criteria and prepares a query to the database 64. The server 60 submits the query to the database 64 

so across the firewall 62. The database responds by provkfing a set of information matching the query to the server 60. 
The server 60 formats the set of information for delivery to the browser 204. In one embodiment the server 60 reads 
an HTML template from the storage device 66. makes a copy of the template, and writes the set of information into the 
copy of the template. Thus, the filed-in copy of the template is an output page from the server 60. The output page is 
delivered through the Internet 50 to the browser 204. In this way. the server 60 processes and responds to the search 

55 request received from the I IM 208 in a manner similar to that generally used for requests from browsers that do not form 
part of the invention. The server 60 does not "know" that the IIM 208 is issuing the search request Advantageously, the 
invention does not require mocff ication of an existing server of a directory service. 

When the output page is received at the browser 204, the IIM 208 reads the output page. Referring to the HTML 
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Template value for the network resource in the Site Profile file 220, the If M 208 locates the set of information in the out- 
put page. The set of information is written to a Raw List file 240. Thus, using the HTML Template value, the IIM 208 
essentially strips out unnecessary HTML information from the output page, leaving only the raw data derived from the 
database 64 in the Raw List 240. 

5 Using the Record Template value for the selected network resource, the IIM 208 reads each record from the Raw 
List file 240, applies conversion criteria read from the Record Template, and writes one or more records or fields in a 
Formatted List file 250. In this Way, the IIM 208 converts the information imported from the database 64 into a format 
that is usable by the mailing fist manager 206. 

For example, the database 64 stores the middle initial of an intfviduaf's name concatenated with the first name of 

io the individual, whereas the mailing list manager 206 stores the middle initial and first name in separate fields. In this 
example, the Record Template would descrtoe the format of each record in the database 64 as delivered in the output 
page. The IIM 208 would read each first name value from the Raw List file 40, identify the middle initial value, and write 
the first name and middle initial separated by a delimiter character in the Formatted List file 250. In the preferred embod- 
iment the mailing list manager 206 has an integral Import Wizard facility. The Import Wizard is program code that can 

75 receive from a user a description of values and records in an input file, read the input file, convert records in the input 
f3e into a format usable by the mailing list manager 206, and write values from the records to the mailing list database 
260. One known program usable as a mailing list manager that has an Import Wizard is the Pitney Bowes SmartMaffer 
product 

After writing the Formatted List file 250, the IIM 208 passes the Record Template value and the name of the For- 

20 matted List file 250 to the Import Wizard of the mailing list manager 206, and commands the maifing Est manager 206 
to import the Formatted List fie 250. In response, the Import Wizard reads the Record Template value, reads each 
record of the Formatted List file 250, and Writes values from the records into the mailing list database 260. 

Thereafter, the mailing fist manager 206 is used in conventional manner to prepare maifing fists from the imported 
information, according to desired selection criteria and to store the lists on one of the storage devices 171, 172, 173. 

25 Mailing labels or the mailing fists are printed on the printer 1 54. Customised documents for printing and mailing are gen- 
erated using the mailing list manager 206 with the imported information. The maifing list manager 206 may be used to 
modify the imported information or to select a range of records from the inported information. In this way. free informa- 
tion is imported from a network resource for use by a local processing resource, for example, in creating, managing and 
using mailing fists derived from the free information. 

30 An alternate embodiment is shown in FIG. a. In this embodiment HTTP communication functions and HTML inter- 
pretation functions of the browser 204 are provided in an HTTP/HTML module 209 that is integrated into the maifing list 
manager 206. Functions and processes performed by the browser 204 in the preceding discussion are performed by 
the HTTP/HTML module 209 in the embodiment of FIG. 3. The embodiment of FIG. 3 is advantageous in that it does 
not require the end user to use, obtain, or install a browser program. In addition, it enables the IIM 208 to be prepared 

35 with less reliance on the bridge software 210; communication between IIM 208 and the module 209 can be hard coded 
into the IIM 208 and the module 209. 

In another alternative of the emtxxfiment of FIG 2A. when the IIM 208 issues commands to the browser 204, the 
IIM 208 instructs the browser 204 not to display Web pages received from the Web server 60 on the display 120. In this 
way, from the user's perspective, actions of the browser occurs IIM 

40 208 is not cfisturbed. In yet another alternate embodiment the IIM 208 is a browser plug in that is stored in association 
with the browser 204. In this embodiment when a user loads and executes the maiGng fist manager 206. the mailing 
fist manager 206 loads the browser 204. In turn, when the browser 204 toads, rt 

to itself, and executes the IIM 208 in the manner generaly known in the art for instating and executing browser plug- 
ins. The advantage lies in improved speed performance over OLE automation techniques. 

45 In stfll another alternate embodiment the IIM 208 is an applet written in the Java language. In still a further alternate 
embodiment, the IIM 208 is an Active-X control. In each of these embodiments, when a user loads and executes the 
mailing list manager 206. the maifing fist manager 206 loads the browser 204. In turn, when the browser 204 loads, it 
loads the IIM 208, attaches the IIM 208 to itself, and executes the IIM 208 in the generally manner known in the art for 
instalfing and executing applets and Active-X controls. The use of JAVA allows constant version control through 

so updates, while Active-X allows improved speed over OLE automation. 

The invention also encompasses certain methods. In one embodiment a method of importing information from a 
network resource has the steps shown in FIG. 4A. In step 350. a user of the method creates a set of selection criteria 
for the information to be imported. In step 352, a connection is established to a resource that is remote, such as a net- 
work resource. The network resource stores a set of information from which a subset is to be imported. In step 354. 

ss information from the network resource is requested according to the selection criteria In step 356, the information is 
locally received. 

In step 358, the received information is converted to a form that is usable by a local processing resource In step 
360, the converted information is defivered to the local processing resource for use by it. In step 362, the process is 
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complete. 

An aftemateentxxJiment of a method for automatically importing information from a network resource has the 
*eps sr»*n ,n RG. 4B and RG. 4C. .n this embodiment, in step 402, a maifing list manager * run or SZZZZ 
404. a user of the process selects an information import function of the mailing list manager. In step 406. a list of known 
reS^^T^ ?" * ^ * the process is prompted to selecTone of the network 

^ J? ^ 41 °- Process tests whether the user has selected one of the network resources, and 
it not, enters a loop until a selection is made. 

VVhen the user selects a srte. the identification of the site is received in step 41 2. A site profile for the selected site 
Tr^Ht^ f L 4 ' "L^TJ 16, * B , usefis prom P <ed to enter a set of search crrteria that describe the information to be 
s*^ I" step 418. a browser capable of contacting and receiving information from the 
selected srte is invoked or run. The srte name and location, and the search criteria, are provided to the browser instep 

In step 422. the process tests whether a response has been received from the site by the browser If not control is 
f T S8d l°?j**; 4 - in which the process tests whether a predetermined timeout factor has elapsed. If not control 
returotostep422toaga.ntestwhe^ 

reports a ierror_ and prepares to discontinue processing. The process then ends in step 428. When a response is 

J^SST T**^™ teSt 01 422 * affirmative ' ^ contro1 * Passed to step 430. in which a raw mailing list 
is received from the site. 

In 6^ 432, the rawniafling list is «)nve* 

^LV°^l^ e * 68leCted 8rte « read to identify the expected format of the^w mJfngl*1nsS 442' 
uar^^ .nforrnat^ re^ f^ 

^ ™ Tfl" 9 . step 4441 (Sel]Trite ^ that identify data values in the raw mailing fist are inserted into thefet 
The re-formatted Est is written to an output f3e in step 446. inrome iisi. 

i«t ^™ 4 ^^'!^! mceon 01 fte maiUng list mana 3 er te mokoa - Usi"9 the import wizard, the mailing 
T^^^^JTl ^ in step 446. identifies discrete data values^ fte^cords of Z 

SSs^SS'S 10 8 datSbaSe fe to "* "^"^ *" ^ ,ist The 

The invefitoi also encompasses a computer program product cofigured to carry out the processing steps 
«scnbed heron. The computer program proAict is ent>c<fied in a merncry medium such a^ 
h^r^ magnefc tape, or r^ital VKto Disk 

c^n proc^^om^ for controlling the computer shown in FIG. 1 A and FK3. 1 B to enable the computer to per- 

^^^^J™**** 3 ^ in9 10 "* inVe " ti0a ^ mem0fy 3,80 informationfor 
estebtshing ,n the mam memory of the computer the software architecture of the invention shown in FIG. 2A and FK3.3 
and the alternative software architecture ernbodiments cSscussed herein. 

1 iri b^ambi*^''' """'"'''illllflj n®'® been described above in terms of URLs, the invention is not limited to the use of 
t^tiS^Tt e,t !2? e t 8 ad * e6S> *" *° temi " nelWDrt( fe Wended to refer broadly to any 

Z ^' er _lf er !? C8 8 to ****** network location or document, tn addition, the 

W^ewe^one 

wiD l ^c!!^¥'l^? 0^ • 4,18 '"^^ dGSCTtoed ^ fefefence to 6 P ecific errtodimerrts thereof. It 
ZJ^^ te*^** vanous "xxfihcatjons and changes may be made thereto without departing from the 

S^SS *^ *™ ,nVerti0n - ^ ™* are. accordingly, to be regaSedVan Blustra 

tve rather than a restrictive sense. 

Claims 

1, Am ethodofinrportir^ 

(a) storing selection criteria in a memory coqpled to a local processing resource; 

(b) revesting said Information from said remote resource according to said selection criteria; 

(c) receiving said information from said remote resource over said network; and 

i !^^ ng **** inf0rmatk>n to ^ Processing resource in a format usaWe by said local processing 



2. 



The metixxi recited in claim 1 , wherein the step of requesting said irrformation further comprises the step of- con- 
necting to said remote resource using an application program driven by said local processing resource. 
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3. The method recited in claim 1 , wherein the step of requesting said information further comprises the step of: con- 
necting to sad remote resource using a browser driven by said local processing resource using a bridge 

4. The method recited in claim 1 , wherein the step of requesting said, information further comprises the step of: con- 
necting to said remote resource using a browser driven by said local processing resource using OLE Automation. 

5- The method recited in claim 1 . further comprising the steps of: storing, in said memory, a list of a plurality of remote 
information sources located on said network and a profile describing a first format of said information at one of said 
remote information sources. 

6. The method recited in claim 5, wherein the step of storing said list further comprises the step of: retrieving current 
versions of said fist and said profile from a central data storage location on said network. 

7. The method recited in claim 5, wherein the step of providing said information comprises the further steps of. 

(a) converting said information from said first format to a second format usable by said focal processing 
resource according to said profile; 

(b) invoking an import function of said local processing resource; and 

(c) using said import function, to store said information in said second format in a database coupled to said 
focal processing resource. 

a The method recited in claim 1 , wherein said step of providing said information comprises the steps of: 

(a) executing a browser; and 

(b) commanding said browser with said local processing resource to connect to said remote information source 
over said network. 

9. The method recited in claim 1 , wherein the step of storing selection criteria in a memory coupled to a local process- 
ing resource comprises the step of: storing a query in a Web page on a storage device accessible to a browser. 

10. The method recited in claim 9, wherein the step of requesting said information comprises the step of: submitting 
said Web page and said query using said browser to a Web server over said network. 

11. The method recited in claim 10, wherein the step of receiving said information further comprises the step of receiv- 
ing an output Web page at said browser comprising said information from said remote resource. 

12. The method recited in claim 1 1 , wherein the step of providing said information further comprises the step6 of: 

(a) writing a raw mailing list derived from said information on said storage device; and 

(b) formatting said raw mailing fist into a formatted fist according to a format of a database coupled to a mailing 
list manager and coupled to said local processing resource. 

1 3. The method recited in claim 1 2, wherein the step of provkfing said information further comprises the step of irrfxxt- 
ing said formatted list into said database of said mailing Gst manager using an automatic import function of said 
mailing list manager. 

14. Computer apparatus comprising: 

(a) a processor, and 

(b) a memory coupled to said processor; the memory having stored therein an identifier of at least one remote 
information source located on a network coupled to said processor, in which said processor is configured to: 

0) store selection criteria in a memory coupled to a local processing resource; 

(ii) request said information from said remote resource according to said selection criteria; 

(Hi) receive said information from said remote resource over said network; and 

(iv) provide said information to said focal processing resource in a format usable by said local processing 
resource. 
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15. The apparatus recited in claim 14, wherein said processor is configured to connect to said remote resource using 
an application program driven by said local processing resource. 

16. The apparatus recited in claim 14, wherein said processor is configured to connect to said remote resource using 
a browser driven by said local processing resource using a bridge. 

17. The apparatus recited in claim 14, wherein said processor is configured to connect to said remote resource using 
a browser driven by said local processing resource using OLE Automation. 

18. The apparatus recited in claim 14, wherein said processor is configured to: 

(a) store, in said memory, a list of a plurality of remote information sources located on said network and a pro- 
ffle describing a first format of said information at one of said remote information sources; and 

(b) retrieve current versions of said list and said profile from a central data storage location on said network. 

19. The apparatus recited in claim 14, wherein said processor is configured to: 

(a) convert said information from said first format to a second format usable by said local processing resource 
according to said profile; 

(b) invoke an import function of said local processing resource; and 

(c) store said information using said import function, in said second format in a database coupled to said focal 
processing resource. 

20. The apparatus recited in claim 19, wherein said processor is configured to: 

(a) execute a browser; 

(b) command said browser with said local processing resource to connect to said remote information source 
over said network; 

(c) store a query in a Web page on a storage device accessible to a browser; 

(d) submit said Web page and said query using said browser to a Web server over said network; 

(e) receive an output Web page at said browser comprising said i n formation from said remote resource; 

(f) write a raw mailing Dst derived from said information on said storage device; 

(g) format said raw mailing list into a formatted fet according to a format of a database coupled to a mailing list 
manager ad coupled to said local processing resource; and 

(h) import said formatted list into said database of said mailing list manager using an automatic import function 
of said mailing list manager. 
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FIG. IB 
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FIG. 2A 
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FIG. 3 
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FIG, 4A 
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FIG. 4B 
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